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Description 

[0001] The present invention relates to the use of a polynucleotide sequence encoding Cathepsin Y or an amino 
acid sequence of Cathepsin Y protein for the characterisation or identification of therapeutic agents for stroke, the use 

5 of such sequences for the development of a medicament, and agents useful as medicaments for the treatment of stroke. 
[0002] Stroke is a rather common and potentially harmful event that often leaves patients severely disabled for the 
rest of their lives. It is the result of either an interruption of blood and therefore oxygen supply to the brain, or bleeding 
in the brain that occurs when a blood vessel bursts. The reduced oxygen supply leads to shortage of energy in the 
affected brain regions and results in the death of neurons around the lesion. The degeneration spreads from the site 

10 that is affected by the reduced blood supply into regions which have at all times obtained sufficient oxygen. 

[0003] Stroke results from a loss of blood flow to the brain caused by thrombosis or haemorrhage. With an incidence 
of 250-400 in 100.000 and a mortality rate of around 30%, stroke is a major public health problem. About one-half of 
the stroke survivors suffer from significant persisting neurological impairment and/or physical disability. Thus, the eco- 
nomic costs of stroke amount to many billions of dollars worldwide. 

15 [0004] A stroke occurs when the blood supply to part of the brain is suddenly interrupted or when a blood vessel in 
the brain bursts. As a consequence, brain cells die when they no longer receive oxygen and nutrients from the blood 
or when they are damaged by sudden bleeding into the brain. Some brain cells die immediately after interruption of 
the blood flow into the brain, while others remain at risk for death and stay in a compromised state for hours. These 
damaged cells make up the so-called "ischemic penumbra", and with timely treatment these cells could be saved. 

20 Stroke ultimately leads to infarction, the death of huge numbers of brain cells, which are eventually replaced by a fluid- 
filled cavity (or infarct) in the injured brain. 

[0005] There are two major forms of stroke: ischemic - blockage of a blood vessel supplying the brain, and hemor- 
rhagic - bleeding into or around the brain. An ischemic stroke can be caused by a blood clot (embolus or thrombus), 
which is blocking a vessel. It can also be caused by the narrowing of an artery due to the build-up of plaque (a mixture 

25 of fatty substances, including cholesterol and other lipids). The underlying pathological process called stenosis is often 
observed in arteriosclerosis, the most common blood vessel disease. About 80% of all strokes are ischemic strokes. 
A hemorrhagic stroke is caused by the bursting of an artery in the brain. Subsequently, blood spews out into the 
surrounding tissue and upsets not only the blood supply but also the delicate chemical balance neurons require to 
function. Hemorrhagic strokes account for approximately 20% of all strokes. 

30 [0006] A transient ischemic attack (TIA), sometimes called a mini-stroke, starts just like a stroke but then resolves 
leaving no noticeable symptoms or deficits. The occurrence of a TIA is a warning that the person is at risk for a more 
serious and debilitating stroke. About one-third of patients who have a TIA will have an acute stroke sometime in the 
future. The addition of other risk factors compounds a person's risk for a recurrent stroke. The average duration of a 
TIA is a few minutes. For almost all TIAs, the symptoms go away within an hour. There is no possibility to distinguish 

35 whether symptoms will be just a TIA or persist and lead to death or disability. 

[0007] Recurrent stroke is frequent; about 25 percent of people who recover from their first stroke will have another 
stroke within 5 years. Recurrent stroke is a major contributor to stroke disability and death, with the risk of severe 
disability or death from stroke increasing with each stroke recurrence. The risk of a recurrent stroke is greatest right 
after a stroke, with the risk decreasing with time. About 3 percent of stroke patients will have another stroke within 30 

40 days of their first stroke and one-third of recurrent strokes take place within 2 years of the first stroke. 

[0008] The most important risk factors for stroke are hypertension, arteriosclerosis, heart disease, diabetes, and 
cigarette smoking. Others include heavy alcohol consumption, high blood cholesterol levels, illicit drug use, and genetic 
or congenital conditions, particularly vascular abnormalities. People with multiple risk factors compound the destructive 
effects of these risk factors and create an overall risk greater than the simple cumulative effect of the individual risk 

45 factors. 

[0009] Although stroke is a disease of the brain, it can affect the entire body. Depending on the affected brain region 
and the severity of the attack, post-stroke patients suffer from a variety of different symptoms. Some of the disabilities 
that can result from stroke include paralysis, cognitive deficits, speech problems, emotional difficulties, daily living 
problems, and pain. Stroke disability is devastating to the stroke patient and family, but therapies are available to help 
50 rehabilitate post-stroke patients. The mortality rate observed in ischemic stroke is around 30%. The time window for 
a medical treatment is narrow and limited to anticoagulants and thrombolytic agents, which must be given immediately 
(at latest 3 hours) after having a stroke. Unfortunately, there is no effective neuroprotective medication available, which 
is able to stop the delayed degeneration of neurons following the initial stroke attack. 

[0010] Stroke symptoms appear suddenly. The following acute symptoms can be observed: Sudden numbness of 
55 the face, arm or leg, difficulties in talking or understanding speech, trouble seeing in one or both eyes, sudden trouble 
in walking, loss of balance and coordination. Severe headache with no known cause does also occur. Even more 
importantly, there is a variety of severe disabilities occurring and persisting in post-stroke patients. Paralysis is a fre- 
quent disability resulting from stroke. Cognitive deficits (problems with thinking, awareness, attention and learning) are 
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also commonly observed. Post-stroke patients exhibit language deficits and also emotional deficits (like post-stroke 
depression). Furthermore, an uncommon type of pain, called central pain syndrome (CPS), can occur after having a 
stroke. 

[0011] Currently the only effective treatment for thrombotic stroke is the use of anticoagulants (e.g. heparin), and 
5 thromobolytics (recombinant tissue plasminogen activator). Neuroprotective agents, which are effective in animal mod- 
els, have generally proved ineffective in the clinic, and none are yet registered for use in stroke. 
[0012] A number of experimental models have been developed for global ischemia and focal ischemia. The availa- 
bility of these different models provides an opportunity to investigate mechanisms of stroke. Finding common features 
in different models or over several time points within one model should provide better insight into the mechanisms 
10 critical for stroke, and comparison of the models should help to understand development, progression and consequenc- 
es of stroke. 

[0013] Most common global ischemia models are 

a) the two vessel model 

15 Models of transient global ischemia resulting in patterns of selective neuronal vulnerability are models that 

attempt to mimic the pathophysiology of cardiac arrest or hemodynamic conditions that result from severe systemic 
hypotension. Reversible high-grade forebrain ischemia is generated by bilateral common carotid artery (CCA) 
occlusion. Together with systemic hypotension conditions it reduces the blood flow to severe ischemic levels (Smith 
et al. 1984, Acta Neuropathol. 64; p31 9-332). This model of transient global ischemia has the advantage of a one 

20 stage surgical preparation, the production of a high-grade forebrain ischemia and the possibility to conduct chronic 

survival studies in order to assess the potential of neuroprotective drugs. 

b) the four vessel model 

This model results in a high-grade forebrain ischemia but is produced in two stages, one to manipulate each 
25 of the CCA, and the second stage 24h later to produce the forebrain ischemia. The advantage is that the second 

step can be produced in awake freely moving animals (Pulsinelli et al. 1982, Ann. Neurol. 11; 491-498). Similar 
pathohistological results could be obtained for both vessel models. 

c) the cardiac arrest model 

30 Forebrain ischemia models are of value to study cerebral ischemia but these models do not exactly mimic the 

hemodynamic consequences of a cardiac arrest, which results in a complete ischemia of the brain, spinal cord 
and extracerebral organs (Katz et al. 1995; J. Cereb. Blood Flow Metab. 15; 1032-1039). The initial cardiac arrest 
models from Safar et al. (1 982, Protection of tissue against hypoxia; Elsevier Biomedical Press; 1 47-1 70 ) or Ko- 
rpaczew et al. (1982, Partol Fizjol Eksp Ter 3; 78-80), were developed further by Katz et al. (1989, Resuscitation 

35 17; 39-53) and Pluta et al. (1991, Acta Neuropathologica 83; 1-11) to models with controllable insult. Katz and 

colleagues (1995) have reported a reproducible outcome model of cardiac arrest with apneic asphyxia of 8 min, 
leading to the cessation of circulation at 3-4 min of apnea and resulting in cardiac arrest of 4-5 min. At 72 hr after 
injury, widespread patterns of ischemic neurons were found in many brain regions, including cerebral cortex, cau- 
date putamen CA1 and CA3 regions of hippocampus, thalamus, cerebellum and brain stem. 

40 Pluta et al. described a primary mechanical cardiac arrest model whereby global ischemia was induced by cardiac 

arrest for 3 to 10 min with survival periods of the animals from 3 min to 7 days. 

Although these models have several limitations, they provide a method for studying the mechanisms of neuronal 
injury resulting from the clinically realistic cerebral insult and screening potential cerebral resuscitation therapies. 

45 Focal Ischemia models 

[001 4] Models of permanent or transient focal ischemia typically giving rise to localised brain infarction have routinely 
been used to investigate the pathophysiology of stroke. For example, models of middle cerebral artery (MCA) occlusion 
in a variety of species have gained increased acceptance due to their relevance to the human clinical setting. 

50 

a) the permanent MCA Occlusion models 

[0015] Tamura and colleagues (1981 . J. Cereb. Blood Flow Metab. 1 ; 53-60) developed a subtemporal approach as 
standard model of proximal MCA occlusion. In models of permanent MCA occlusion, electrocauterisation of the MCA 
55 proximal to the origin of the lateral lenticulostriate arteries is utilised routinely. In these models, severe reductions in 
blood flow are seen within the ischemic core, with milder reductions in blood flow within the border or penumbral regions. 
The addition of moderate arterial hypotension has the effect of enlarging infarct volume. 
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b) the transient MCA Occlusion models 

[0016] In human ischemic stroke, recirculation frequently occurs after focal ischemia. Thus, models of transient MCA 
occlusions have also been developed, mainly in rats or mice, whereby surgical clip or sutures are introduced to induce 
5 a transient ischemic insult. 

[0017] Some further animal models for stroke are considered in several reviews like the articles of W.D. Dietrich 
(1998; Int. Review of Neurobiology, Vol.42; 55-101) , Wiebers et al. (1990 Stroke 21; 1-3) orZivin and Grotta (1990 
Stroke 21; 981-983). 

[0018] Object of the present invention is to provide a target for examination and development of a medicament for 
10 the treatment of stroke. 

[0019] This object is met by a method for the development of a medicament for the treatment of stroke comprising 
the use of a polynucleotide sequence encoding Cathepsin Y or homologues or fragments thereof or the according 
polypeptide. 

[0020] According to the invention it has been found that Cathepsin Y is differentially expressed under stroke, panic- 
's ularly under ischemic stroke. In the present invention it is shown that in the rat models Cathepsin Y expression is 
upregulated under stroke, particularly under global ischemia in the cardiac arrest model (Pluta et al, 1991 Acta Neu- 
ropathol. 83; 1-11). 

[0021] The model is explained in more detail in the literature and in the examples below. 

[0022] The term "polynucleotide sequence" or "nucleic acid sequence" designates in the present application any 
20 DNA or RNA sequence, independent of the length. Thus this term can describe short sequences like PCR primers or 
probes for hybridisation, as well as whole genes orcDNA of these genes. 

[0023] The term "polypeptide" or "amino acid sequence" designates a chain of amino acids, independent of their 
length, however, in any case more than one amino acid. 

[0024] As "homologues" of polynucleotide sequences such polynucleotide sequences are designated which encode 

25 the same type of protein as one of the polynucleotide sequences described herein. Accordingly as "homologues" of a 
polypeptide the polypeptides are designated, which have an amino acid sequence, wherein at least 70 %, preferably 
80 %, more preferably 90 % of the amino acids are identical to one of the proteins of the present invention and wherein 
the replaced amino acids preferably are replaced by homologous amino acids. As "homologous" amino acids are 
designated those, which have similar features concerning hydrophobicity. charge, steric features etc.. Most preferred 

30 are amino acid sequences, containing the species- or family-dependent differences of the amino acid sequence. Par- 
ticularly as "homologues" sequences are designated which correspond to one of the cited sequences in another species 
or individual. For example if in the present invention a rat model is used and the cited polynucleotide sequence encodes 
the rat protein, the according polynucleotide sequence and protein of a mouse in a mouse model is designated as 
"homologue". Further splice variants and members of gene families are designated as homologues. 

35 [0025] "Fragments" of a polynucleotide sequence are all polynucleotide sequences, which have at least 1 0 identical 
base pairs compared to one of the polynucleotide sequences shown in the present application or by the genes repre- 
sented by these polynucleotide sequences. The term "fragment" encloses therefore such fragments as primers for 
PCR, probes for hybridisation, DNA fragments included in DNA vectors like plasmids, cosmids, BACs or viral constructs, 
as well as shortened splice variants of the genes identified herein. As a fragment of a protein (polypeptide) amino acid 

40 sequences are designated which have at least three amino acids, preferably at least 10 amino acids. Therefore frag- 
ments serving as antigens or epitopes are enclosed in this designation. 

[0026] In the present application the term "sequence" is used when either a polynucleotide sequence (= nucleic acid 
sequence) or a polypeptide (= amino acid sequence) or a protein is meant. That means, when it is irrelevant which 
type of sequence is used the type is not designated particularly, but with the more common term "sequence". 
45 [0027] In the present application the term "stroke" means the development, occurence, progression and consequenc- 
es of the disease state. Several features of the development, occurance and consequences of this disease are de- 
scribed herein above. 

[0028] The basis of the methods and assays described in the present application is the examination of Cathepsin Y, 
which is differentially expressed under stroke. For the examination necessary for the development of any medicament 

50 each sequence can be used which allows the determination of the expression rate of the Cathepsin Y gene or the 
activity of the polypeptide / protein. The considered sequence can be the polynucleotide sequences SEQ ID NO. 1, 
SEQ ID NO. 2 or SEQ ID NO. 4. respectively or homologues or fragments thereof, as well as the polypeptides encoded 
thereby (SEQ ID NO. 3 or SEQ ID NO. 5, respectively) or homologues or fragments thereof. 
[0029] According to the invention it has been found out, that the Cathepsin Y gene represented by the polynucleotide 

55 sequences SEQ ID No: 2 is differentially expressed in the cardiac arrest models for stroke, which is described above. 
[0030] Therefore the present invention provides a target for the development of medicaments for the treatment of 
stroke, particularly for the treatment of ischemic stroke. Cathepsin Y has not yet been regarded in relation to stroke. 
[0031] JP20001 57263 describes a human Cathepsin Y useful in the elucidation and therapy for Alzheimer's disease, 
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neurodegenerative diseases, cancers and the like. Comparing the nucleic acid sequence shown in this publication with 
the sequences of common databases, however, shows that the provided sequence corresponds to Cathepsin F, not 
to Cathepsin Y. 

[0032] WO 96/39194 teaches that the Cathepsin Y protein is involved in the secretion of p-amyloid peptide (p-AP) 
5 from cells, resulting in the development of Alzheimer's disease. The nucleic acid sequence as well as the protein 
sequence of Cathepsin Y is provided. For the treatment of Alzheimer's disease the inhibition of this protein is proposed, 
therefore inhibitors of Cathepsin Y are examined. 

[0033] According to the present invention the Cathepsin Y sequences (polynucleotide sequence or polypeptide) can 
be used in any test system which allows the determination of either the expression rate of the gene or the activity of 

10 the polynucleotide sequence or of the polypeptide / protein. 

[0034] For determination and comparison of the expression levels of at least one of the sequences identified in the 
present invention any of the commonly known methods can be used, either on RNA/cDNA level or on protein level. 
For example PCR, hybridisation, microarray based methods, western blot or 2-D protein gel analysis are suitable 
methods. One preferred method is the digital expression pattern display method (DEPD method), explained in detail 

15 in WO99/42610. The method used for determination of expression levels is not restrictive, as long as expressed 
amounts can be quantified. 

[0035] Activity of the Cathepsin Y protein can be determined by contacting the protein with polypeptides under acid 
pH conditions and determining the concentration of free amino acids after cleavage. Activity of the Cathepsin Y protein 
is a carboxypeptidase activity. Methods for determination of Cathepsin Y activity are explained in detail in international 
20 patent application WO 96/391 94. 

[0036] Expression and activity of Cathepsin Y can further be determined in assay systems or models. Such assay 
systems may be in vivo, ex vivo or in vitro assays, for example a cellular assay system comprising cells expressing 
Cathepsin Y, or an assay comprising isolated Cathepsin Y 

[0037] In any assay or model at least one of the sequences is contacted with the compound(s) to be tested and 
25 samples are obtained, wherein expression levels or activity of the sequences are determined and compared to non- 
treatment conditions. 

[0038] For examination of the expression rate of the gene, animal models can be used. As such a model any animal 
can be used wherein the necessary preparations can be carried out, however, mammalian models are preferred. Even 
more preferred are rodents, particularly preferred are rats and mice. The most preferred animal model of the present 
30 invention is a rat model. 

[0039] Dependent of the model used, the samples can be derived from whole blood, cerebrospinal fluid (CSF) or 
whole tissue, from cell populations isolated from tissue, cerebrospinal fluid (CSF) or blood or from single cell populations 
(i.e. cell lines). 

[0040] In one embodiment of the invention cellular assays can be used. Preferred cells for cellular assays are eu- 

35 karyotic cells, more preferably mammalian cells. Most preferred are neuronal-like cells, like SHSY5Y (neuroblastoma 
cell line), hippocampal murine HT-22 cells, primary cultures from astrocytes, cerebral cortical neuronal-astrocytic co- 
cultures, mixed neuronal/glial hippocampal cultures, cerebellar granular neuronal cell cultures, primary neuronal cul- 
tures derived from rat cortex (E15-17). or COS cells (African green monkey, kidney cells); CHO cells (Chinese hamster 
ovary), or HEK-293 cells (human embryonic kidney). 

40 [0041 ] The Cathepsin Y sequences of the present invention can be used for diagnosing stroke, particularly ischemic 
stroke of a human outside of the living body, by determining the expression levels or activity of Cathepsin Y sequences 
in comparison to the non-disease status. During treatment period of a patient the expression or activity of the sequences 
can also be used for assessing the efficacy of stroke treatment outside of the body. In these cases blood, CSF or tissue 
is removed from the patient and expression or activity is determined in the samples. 

45 [0042] The disease status, which is considered in the present invention, is stroke. The preferred types of stroke are 
ischemic and hemorrhagic stroke. Further diseases, which might be related to stroke, are severe headache, paralysis, 
cognitive deficits, speech problems, emotional difficulties, daily living problems, and central pain syndrome. 
[0043] Independent whether the efficacy of stroke treatment or the efficiency of a compound for the treatment of 
stroke shall be examined or a stroke status is diagnosed or characterised, determination of the expression level or the 

50 activity of Cathepsin Y is carried out outside of a living body. As mentioned above, detection of the expression of 
Cathepsin Y can be carried out by any method known in the art. The method of detection is not limiting the invention. 
[0044] Expression levels can be detected either on basis of a polynucleotide sequence or by detecting the according 
polypeptide, encoded by said polynucleotide sequence. 

[0045] Preferred methods for detection and determination of the gene expression levels are PCR of cDNA, generated 
55 by reverse transcription of expressed mRNA, hybridisation of polynucleotides (Northern-, Southern Blot systems, In 
situ hybridisation), DNA-microarray based technologies, detection of the according peptides or protein via e.g. Western 
Blot systems, 2-dimensional gel analysis, protein microarray based technologies or quantitative assays like e.g. ELISA 
tests. 
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[0046] The most preferred method for quantitative analysis of the expression levels is the differential expression 
pattern display method (DEPD), described in detail in WO99/42610. 

[0047] By using any of the Cathepsin Y sequences (polynucleotide sequence or polypeptide / protein) the efficiency 
of compounds for reducing the expression of the sequence or the activity of the protein can be tested. Therefore the 
5 sequences of the present invention can be used for identifying therapeutic agents and their efficacy for the treatment 
of stroke. 

[0048] For example a method can be used comprising: 

a) providing a test cell population comprising cells capable of expressing the Cathepsin Y gene(s) or homologues 
10 or fragments thereof 

b) contacting said test cell population with the test therapeutic agent, 

c) detecting the expression of the Cathepsin Y gene(s) in said test cell population, 

d) comparing the expression of the gene(s) in the test cell population to the expression of the gene(s) in a reference- 
cell population whose disease stage is known; and 

15 e) identifying a difference in expression levels of the considered sequences, if present, in the test cell population 

and the reference cell population, thereby identifying a therapeutic agent for treating stroke. 

[0049] For this method an animal model can be used or test cells can be obtained from a subject, an animal model 
or cell cultures of fresh cells or cell lines. Further in vitro assays may be used. 

20 [0050] Agents or compounds, which have any influence on the expression rate of the Cathepsin Y gene(s) might be 
used as a medicament for the treatment of stroke. A method examining the expression rate of the Cathepsin Y gene 
can therefore be used for testing agents and compounds for their efficiency for treatment of stroke. Which model is 
used is not relevant, as long as the model allows determining differences in expression amounts. 
[0051] In such a model cells are contacted with the interesting agent or compound, and expression of Cathepsin Y 

25 gene is determined in comparison to the expression in cells which have never been contacted by the according agent 
/compound. Contacting the cells with the agent / compound can either be achieved by administering the agent / com- 
pound to an animal, or to isolated cells of tissue or blood, or cell lines in culture. 

[0052] By examination of the influence the considered agent / compound has to the expression of the Cathepsin Y 

gene the efficacy of the agent / compound for treating stroke can be estimated. This allows the decision whether it is 
30 worthwhile to develop a medicament containing such an agent or compound for the treatment of stroke. 

[0053] Whether the expression is determined on the basis of mRNA generation (transcription level) or on basis of 

protein generation (translation level) is not relevant, as long as the difference of the expression rate can be determined. 

Therefore the polynucleotide sequence, as well as the polypeptide or protein shown in the present application can be 

used for the development of a medicament. 
35 [0054] The development of a medicament can be desirable for example if the considered compound /agent has any 

influence on the regulation of the expression rate or on the activity of Cathepsin Y. 

[0055] Said influence of a compound or agent can be examined by a method comprising contacting a sample com- 
prising the nucleic acid sequence or homologues or fragment thereof or the according polypeptides of the present 
invention with a compound that binds to said sequence in an amount sufficient to determine whether said compound 
40 modulates the expression rate or the activity of the polynucleotide or polypeptide sequence. 

[0056] By such a method a compound or agent can be determined modulating the expression rate or activity of the 
nucleic acid sequence or homologues or fragments thereof or the according polypeptides. 

[0057] A method for determination of the efficiency of a compound / agent for the treatment of stroke by determination 
of the activity of the Cathepsin Y protein comprises for example contacting the Cathepsin Y protein with a compound 
45 /agent in presence of a known polypeptide which is a target of the Cathepsin Y, determining the amino-terminal amino 
acid of the peptide or the amount of free amino acids in the sample after incubation, comparing the amino-terminal 
amino acid of the peptide or the amount of free amino acids with the result in a sample which doesn't contain the 
compound /agent. 

[0058] If a compound / agent is able to decrease or inhibit the activity of the Cathepsin Y protein it might be useful 
50 as a medicament for the treatment of stroke. 

[0059] Examples for compounds, which can decrease Cathepsin Y activity, are compounds of formula I: 
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R 2 



O R 3 
ii I 

CNR'CH— 



-R 4 formula I 



- 1 n 

10 wherein: 

R is selected from the group consisting of hydrogen, alkyl of from 1 to 6 carbon atoms, and where R and R2 are 
joined to form a ring structure of from 4 to 1 0 carbon atoms, 

R' is selected from the group consisting hydrogen, alkyl of from 1 to 6 carbon atoms and where R' and R 3 are 

15 joined to form a ring structure of from 4 to 1 0 carbon atoms, 

R 1 is selected from the group consisting of alkyl of from 1 to 4 carbon atoms substituted with from 1 to 5 substituents 
selected from the group consisting of (a) aryl of from 6 to 10 carbon atoms, (b) aryl of from 6 to 10 carbon atoms 
substituted with 1 to 3 substituents selected from the group consisting of alkyl of from 1 to 6 carbon atoms, aryl of 
from 6 to 10 carbon atoms, alkoxy of from 1 to 6 carbon atoms, aryloxy of from 6 to 10 carbon atoms, hydroxy, 

20 cyano, halo and amino, (c) cycloalkyl of from 3 to 8 carbon atoms and (d) heterocycles of from 3 to 14 carbon 

atoms having from 1 to 3 heteroatoms selected from the group consisting of nitrogen, oxygen and sulfur wherein 
said substituted alkyl group is optionally further substituted with from 1 to 2 hydroxyl groups, alkenyl of from 2 to 
4 carbon atoms substituted with from 1 to 4 substituents selected from the group consisting of (a) aryl of from 6 
to 10 carbon atoms, (b) aryl of from 6 to 10 carbon atoms substituted with 1 to 3 substituents selected from the 

25 group consisting of alkyl of from 1 to 6 carbon atoms, aryl of from 6 to 10 carbon atoms, alkoxy of from 1 to 6 

carbon atoms, aryloxy of from 6 to 10 carbon atoms, hydroxy, cyano, halo and amino, (c) cycloalkyl of from 3 to 8 
carbon atoms and (d) heterocycles of from 3 to 14 carbon atoms having from 1 to 3 heteroatoms selected from 
the group consisting of nitrogen, oxygen and sulfur aryl of from 6 to 10 carbon atoms, aryl of from 6 to 10 carbon 
atoms substituted with 1 to 3 substituents selected from the group consisting of alkyl of from 1 to 6 carbon atoms, 

30 aryl of from 6 to 10 carbon atoms, alkoxy of from 1 to 6 carbon atoms, aryloxy of from 6 to 10 carbon atoms, 

hydroxy, cyano. halo and amino, fluorenyl, heterocycles of from 3 to 14 carbon atoms having from 1 to 3 heter- 
oatoms selected from the group consisting of nitrogen, oxygen and sulfur; 

R 2 and R 3 are independently D- or L-amino acid side chains of at least 2 carbon atoms with the proviso that said 

amino acid side chains do not include the proline side chain; 
35 R4 j s selected from the group consisting of -C(0)CH =N=N, -CH 2 OH : -C=NOH : and -C(0)R 5 where R 5 is hydrogen, 

alkyl of from 1 to 6 carbon atoms, haloalkyl of from 1 to 6 carbon atoms and 1 to 2 halo groups, alkoxy of from 1 

to 6 carbon atoms, -NR 6 R 7 where R 6 and R 7 are independently selected from the group consisting of hydrogen 

and alkyl of from 1 to 6 carbon atoms, and aryl of from 6 to 10 carbon atoms, and -N(CH 3 )OCH 3 ; 

X is selected from the group consisting of -0-, -NR 9 -, and -S- where R 9 is selected from the group consisting of 
40 hydrogen, alkyl of from 1 to 6 carbon atoms and aryl of from 6 to 1 0 carbon atoms; 

Y is selected from the group consisting of -C(O)- and -C(S)-; 

m is equal to zero or one; and 

n is equal to zero, one or two, 

45 or pharmaceutical^ acceptable salts thereof 

with the proviso that when R 1 is 1 -naphthyl R 2 is -CH(CH 3 ) 2 (L-isomer), R 3 is -CH 2 -0 (L-isomer), Y is -C(O)-, m is zero 
and n is one, then R 4 is not -N(CH 3 )OCH 3 , with the further proviso that when R' is diphenylmethyl, 
R 2 is p-(benzyloxy)benzyl (L-isomer), Y is -C(O)-, and m and n are zero, then 

R 4 is not -N(CH 3 )OCH 3 , and with still the further proviso that when R 1 is (1 ,2diphenyl)ethenyl, Y is -C(O)-, R 2 is -CH 2 - 
so 0 (L-isomer), and m and n are zero, then R 4 is not -N((H 3 )OCH 3 . 
[0060] In a preferred embodiment 

R 1 includes benzyl, trityl, diphenylmethyl, 4phenylbutyl : 2-phenylethyl, naphthyl, pyridyl, fluorenyl, xanthanilyl, and the 
like. 

R 2 and R 3 are independent side chains of a D- or L- amino acid having at least 2 carbon atoms with the proviso that 
55 R 2 and R 3 are not proline. Such side chains refer to the R 8 substituent found on naturally occurring and synthetic amino 
acids of the formula H 2 NCHR 8 COOH. Side chains of naturally occurring amino acids include, by way of example only, 
those where R 8 is the L-isomer of (CH 3 ) 2 CH- (valine), (CH 3 ) 2 CHCH 2 (leucine), CH 3 CH 2 CH(CH 3 )-(isoleucine), 0CH 2 - 
(phenylalanine), (3-indolyl)-CH 2 - (tryptophan), CH 3 SCH 2 CH 2 - (methionine), CH 3 CH(OH) (threonine), p-HO-0-CH 2 - 
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(tyrosine), H 2 NC(0)CH 2 - (asparagine), H 2 NC(0)CH 2 CH 2 - (glutamine), HOC(0)CH 2 - (aspartic acid), HOC(0)CH 2 CH 2 - 
(glutamic acid), H 2 NCH 2 CH 2 CH 2 CH 2 - (lysine), H 2 NC(NH)NHCH 2 CH 2 CH 2 - (arginine), 4-imidazolyl-CH 2 - (histidine) 
and the like. 

[0061 ] Side chains of synthetic amino acids include the D-isomer of the above noted naturally occurring amino acids 
5 as well as those where R 8 is selected from the group consisting of alkyl of from 2 to 6 carbon atoms (where the alkyl 
group does not occur in naturally occurring amino acids), cycloalkyl of from 3 to 8 carbon atoms, and alkyl of from 1 
to 6 carbon atoms substituted with from 1 to 2 substituents selected from the group consisting of aryl of from 6 to 1 0 
carbon atoms, 

aryl of from 6 to 1 0 carbon atoms substituted with from 1 to 3 substituents selected from the group consisting of alkyl 
10 of from 1 to 6 carbon atoms, alkoxy of from 1 to 6 carbon atoms, aryl of from 6 to 1 0 carbon atoms, and aryloxy of from 
6 to 10 carbon atoms, and heteroaryl of from 3 to 14 carbon atoms having from 1 to 3 heteroatoms selected from the 
group consisting of nitrogen, oxygen and sulfur (where the substituted alkyl group does not occur in naturally occurring 
amino acids). 

[0062] Particularly preferred amino acid side chains include the D- and L-isomers of valine, leucine, phenylalanine, 
tryptophan and isoleucine. 
[0063] R4 preferably is -CH=N=N or -C(0)H. 
[0064] X preferably is -0-. 
[0065] Y preferably is -C(O)-. 
[0066] m and n preferably are 0 or 1 . 
[0067] In the present application -0 means a phenyl residue. 

[0068] Preferred compounds include, by way of example, the following compounds as defined by formula II below, 
including all isomers thereof, wherein the amino acid side chain for R 2 and R 3 is indicated beneath the R 2 and R 3 
substituent: 



15 



20 



25 



30 



R z 

i 

RifXJm-Y-NK^H- 



O R 
it I 

CNHCH 



-R 



-» n 



formula II 



35 



R 


•x 


m 


Y 


R 2 


n 


R 3 


R 4 


0-CH 2 - 


0 


1 


-C(O)- 


-CH(CH 3 ) 2 (valine) 


1 


-CH 2 -0 (phenylalanine) 


-C(0)CH=N=N 


0-CH 2 - 


0 


1 


-C(O)- 


-CH 2 -0 (phenylalanine) 


1 


-CH 2 -0 (phenylalani ne) 


-C(0)CH=N=N 


(0) 2 -CH- 




0 


-C(O)- 


-CH 2 -0 (phenylalanine) 


0 




-C(0)-H 


0-(CH 2 ) 4 - 




0 


-C(O)- 


-CH 2 -0 (phenylalanine) 


0 




-C(0)-H 


(0) 3 -C- 




0 


-C(O)- 


-CH 2 -0 (phenylalanine) 


0 




-C(0)-H 


(0) 2 -CH- 




0 


-C(O)- 


-CH 2 -CH 2 -0 
(homophenylalanine) 


0 




-C(0)-H 


0CH=C(0) 




0 


-C(O)- 


-CH 2 -0 (phenylalanine) 


0 




-C(Q)-H 


(0) 2 -CH- 




0 


-C(O)- 


-CH 2 -(3-indolyl) 


0 




-C(0)-H 


0-CH 2 - 


0 


1 


-C(O)- 


-CH 2 -0 (phenylalanine) 


1 


-CH(CH 3 ) 2 (valine) 


-C(0)-H 


0-CH 2 - 


0 


1 


-C(O)- 


-CH(CH 3 ) 2 (valine) 


1 


-CH 2 -CH(CH 3 ) 2 
(leucine) 


-C(0)-H 



40 



45 



50 



55 [0069] The term "heterocycles containing from 3 to 14 carbon atoms and 1 to 3 hetereoatoms selected from the 
group consisting of nitrogen, oxygen and sulfur" refers to saturated and unsaturated heterocyclic groups having the 
requisite number of carbon atoms and heteroatoms. Suitable heterocyclic groups include, by way of example, furazanyl, 
furyl, imidazolidinyl, imidazolyl, imidazolinyl, indolyl, isothiazolyl, isoxazolyl, morpholinyl (e.g. morpholino), oxazolyl, 
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piperazinyl (e.g. 1 -piperazinyl), piperidyl (e.g. 1 -piperidyl, piperidino), pyranyl, pyrazinyl, pyrazolidinyl, pyrazolinyl, pyra- 
zolyl, pyridazinyl, pyridyl, pyrimidinyl, pyrrolidinyl (e.g. 1-pyrrolidinyl), pyrrolinyl, pyrrolyl, quinoxalinyl, thiadiazolyl, thi- 
azolyl, thienyl, thiomorpholinyl (e.g. thiomorpholino), triazolyl, and xanthanilyl. 

[0070] If in the present application a chemical group is described as containing for example "from 1 to 6 carbon 
5 atoms" or "from 6 to 1 0 carbon atoms" or "from 1 to 4 heteroatoms" it is meant that the group can contain either 1 or 
2 or 3 or 4 or 5 or 6, or 6 or 7 or 8 or 9 or 1 0, or 1 or 2 or 3 or 4 of said atoms, respectively. 

[0071] Heterocyclic groups can be substituted or unsubstituted. Where the heterocyclic group is substituted, the 
substituents are selected from alkyl of from 1 to 6 carbon atoms, alkoxy of from 1 to 6 carbon atoms, aryl of from 6 to 
10 carbon atoms, aryloxy of from 6 to 10 carbon atoms, and halo. 
w [0072] Preferred heterocycles include well known cyclic aromatic groups containing heteroatoms within the cyclic 
structure. Such groups include, by way of example, furyl, imidazolyl, oxazolyl, pyrazolyl. pyridyl. pyrimidinyl, thiazolyl, 
and triazolyl. 

[0073] The term "alkyl" refers to straight and branched chain alkyl groups such as methyl, ethyl, n-propyL iso-propyl, 
n-butyl, tert-butyl, sec-butyl, isobutyl, n-pentyl, n-hexyl, 2-methylpentyL and the like; whereas the term "alkoxy" refers 
'5 to -0-alkyl substituents. 

[0074] The term "aryl" refers to aromatic substituents comprising carbon and hydrogen such as phenyl, naphthyl and 
the like whereas the term aryloxy refers to -0-aryl substituents where aryl is as defined above. 
[0075] The term "halo" or "halogen" refers to fluorine, chlorine, bromine and iodine and preferably fluorine and chlo- 
rine. 

20 [0076] The term "pharmaceutically acceptable salts" refers to the non-toxic alkali metal, alkaline earth metal, and 
ammonium salts commonly used in the pharmaceutical industry including the sodium, potassium, lithium, calcium, 
magnesium, barium, ammonium, and protamine zinc salts, which are prepared by methods well known in the art. The 
term also includes non-toxic acid addition salts, which are generally prepared by reacting the compounds of this in- 
vention with a suitable organic or inorganic acid. 

25 [0077] Representative salts include the hydrochloride, hydrobromide, sulfate, bisulfate, acetate, oxalate, valerate, 
oleate, laurate, borate, benzoate, lactate, phosphate, tosylate, citrate, maleate, fumarate, succinate, tartrate, and na- 
psylate salts, and the like. The particular salt employed is not critical. 

[0078] Particularly preferred embodiments of the compounds usable as a medicament for the treatment of stroke by 
inhibiting protein activity of Cathepsin Y are the compounds 1 to 13 as shown in the following: 

30 



35 



40 




Compound 1 
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Compound 1 1 




Compound 12 



Preparation of the compounds 1 to 12 of the present application is explained in detail in W096/39194. 
[0079] 




Compound 13 



0 0 



12 
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Preparation of compound 13 is explained in detail in the article of Therrien, C. etal., Biochemistry (2001 ) 40, 2702-2711 . 

[0080] Furthermore the sequences of the present invention itself can be used as a medicament. 

[0081 ] An example for such a use is the use of a polynucleotide sequence as an antisense agent. Antisense agents, 

5 including, but not limited to, ribonucleotide or desoxyribonucleotide oligomers, or base-modified oligomers like phos- 
phothioates, methylated nucleotides, or PNAs (peptide nucleic acids), can hybridise to DNA or mRNA, inhibiting or 
decreasing transcription or translation, respectively. Thus, polynucleotide sequences of a gene, which is increased in 
expression rate under stroke, can be used as antisense agents to decrease the expression rates of said gene. Further 
such polynucleotide sequences can be used for gene therapy. 

10 [0082] A pharmaceutical composition comprising a polynucleotide sequence according to the present invention can 
be any composition, which can serve as a pharmaceutical one. Salts or aids for stabilising the sequence in the com- 
position preferably are present. 

[0083] For the determination of the expression of Cathepsin Y the generated sequence has to be detected. Therefore 
several reagents can be used, which are for example specific radioactive or non-radioactive (e.g. biotinylated or fluo- 
15 rescent) probes to detect nucleic acid sequences by hybridisation, primer sets for the detection of one or several of 
the nucleic acid sequences by PCR, DNA microarrays, antibodies against one of the polypeptides, or epitopes, or 
antibody or protein microarrays. Such reagents can be combined in a kit, which can be sold for carrying out any of the 
described methods. 

[0084] For further examinations or experiments it might be desirable to include the nucleic acid sequence of the 
20 present invention into a vector or a host cell. By including the sequence in a host cell, for example cellular assays can 
be developed, wherein the gene, polynucleotide sequences and the according protein / polypeptides can be used or 
examined. 

[0085] Further the sequences defined in the present invention can be used to "design" new transgenic animals as 
models for stroke. Therefore the animals are "created" by manipulating the gene(s) encoding Cathepsin Y in a way 

25 that their expression in the transgenic animal differs from the expression of the same gene(s) in the wild type. "Manip- 
ulation" preferably results in a different expression level of the Cathepsin Y in the transgenic animal compared to the 
wild type, or in a protein with different enzymatic activity. Whether the expression level or the protein activity becomes 
higher or lower than the wild type under the same conditions depends on the desired model for stroke. Methods of 
gene manipulation and methods for the preparation of transgenic animals are commonly known to those skilled in the 

30 art. 

Figures: 

[0086] Figure 1 shows Cathepsin Y transcript differentially expressed over several time points in the cardiac arrest 
35 model. Cathepsin Y is examined in its expression level over a time period of 0.5 hrs after cardiac arrest to 2 years after 
surgery and compared to sham operated controls. Cathepsin Y is described as differentially expressed when the se- 
quence is up or down regulated at one or more time points with a certain statistical relevant significance value (Ah > 
±0.2). 

[0087] X-axis describes the time points analysed by DEPD, 1= 0.5h post operation, 2= 1 hr post operation, 3= 3 hrs 
40 post operation, 4= 6hrs post operation, 5=day 3 post operation, 6=day 7 post operation, 7= 2 years post operation. 
The Y-axis shows A h, which represents the normalised difference of expression (peak height) of Cathepsin Y between 
a control group and a treated group, x-fold difference in gene expression is calculated by 

1+Ah 

45 1-Ah 

[0088] 0= no change to control, + = up regulation, - - down regulation; (0,2 = 1 ,5 fold; 0,3 = 1 .86 fold; 0.4 = 2,33 
fold; 0,5 = 3 fold expression level). 

[0089] The following examples are provided for illustration and are not intended to limit the invention to the specific 
so example provided. 

Examples: 

Example 1 : Preparation of rat cardiac arrest model 

55 

[0090] Cardiac arrest was performed in female Lewis-rats at 3 months of age (1 50-220g), resulting in total cessation 
of blood flow leading to global cerebral ischemia. After 10 mm of ischemia, the animals were resuscitated by external 
heart massage and ventilation. The group size was 2x3 animals. 
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[0091] For induction of cardiac arrest, a special blunt-end, hook-like probing device was inserted into the right par- 
asternal line across the third intracostal spaces into the chest cavity. Next, the probe was gently pushed down the 
vertrebral column until a slight resistance from the presence of the right pulmonary veins was detected. The probe 
then was tilted 10-20° caudally. Then, the probe was rotated in a counter-clockwise direction about 135-140° under 

5 the inferior vena cava. At this position, the occluding part of the device was positioned right under the heart vessie 
bundle (inferior vena cava, superior right and left vena cava, ascending aorta, and pulmonary trunk). The pulmonary 
veins (left and right) were closed by the rotation of the occluding part of the hook. In the last step, the probe was pulled 
up with concomitant compression of the heart vessel bundle against the sternum. The end of the occluding portion of 
the hook was then positioned in the left parasternal line in the second intercostal space. To prevent upward movement 

10 of the chest and to insure complete ligation of the vessels, simple finger pressure was applied downward the sternum, 
producing total hemostasis and subsequent ventricular arrest. The effect of the whole procedure is total cessation of 
both arterial and venous blood flow, it essentially represents the onset of clinical death. After 2,5-3,5 min, the probe 
was released and removed from the chest by reverse procedural succession, and the animals remained in this position 
until the beginning of resuscitation. 

15 [0092] The resuscitation procedure consisted of external heart massage until spontaneous heart function was re- 
covered and controlled respiration occured. During this time, air was pumped through a polyethylene tube inserted 
intratracheally that was connected to a respirator. External heart massage was produced by the the index and middle 
fingers rapidly striking against the chest (sternum) for 1 -2 min at the level of the fourth intracostal area with a frequency 
of 150-240/min in continuous succession. The ratio of strikes to frequency of ventilation was 6:1 or 8:1 . 

20 [0093] An electrocardiogramm (lead ll-EEG) was recorded continuously during the course of the experiment. More- 
over, heart activity was monitored using a loud speaker connected to the output lead of the electroencephalograph. 
Additionally, the cranial bones were exposed at the sagittal and coronal sutures, where silver-needle electrodes were 
attached for recording on an electrocorticogramm (ECoG). All measurements were registered on a ten-channel elec- 
troencephalogramm (Accutrace-100A, Beckmann). 

25 [0094] 2x3 sham operated animals served as controls. These animals were treated similarly to the experimental 
group with one major exception. Under anesthesia, the probe was inserted through the chest wall into the plural cavity 
as has already been described above but without further manipulation and torsion of the probe. The probe remained 
in the chest for essentially the same time period (3,5 min) as in the experimental group. The control animals were then 
returned to their cages for recovery. 

30 [0095] Tissue preparation occurred 0.5h, 1h, 3hrs, 6hrs, 3 days, 7 days and 2 years after surgery. Tissues were 
frozen on liquid nitrogen prior to RNA preparation. 

Example 2: Determination of expression levels 

35 [0096] Gene expression profiling by DEPD-analysis starts with the isolation of 5-10u.g total RNA. In a second step, 
double-stranded cDNA is synthesized. Through an enzymatic digest of the cDNA with three different type IIS restriction 
enzymes, three pools with short DNA-fragments containing single-stranded overhangs are generated. Afterwards, 
specific DNA-adaptor-molecules are ligated and in two subsequent steps 3.072 PCR reactions are performed by using 
1 024 different unlabelled 5'primer and a common FAM fluorescent labelled 3'-primer in the last PCR step. Subsequently, 

40 the 3072 PCR pools are analysed on an automatic capillary electrophoresis sequencer. 

[0097] Differential gene expression pattern of single fragments are determined by comparison of normalized chro- 
matogram peaks from the control groups and corresponding operated animals. 

Example 3: Sequencing and Databank analysis of the obtained sequences 

45 

[0098] Differentially expressed peaks are confirmed on polyacrylamid gels by using radioactive labelled 3' primer 
instead of the FAM fluorescent primer. Differentially expressed bands are cut from the gel. After a short elution step in 
60^1 1 0mM Tris pH8, fragments are re-amplified by PCR using the same primer as used in the DEPD analysis. Resulting 
PCR products are treated with a mixture of Exonuclease I and shrimp alkaline phosphatase prior to direct sequencing. 
50 Sequencing reactions are performed by using a sequencing kit like DYE-namic ET dye terminator sequencing kit, and 
subsequently analysed by capillary electrophoresis (Megabace 1000, Amersham). 

[0099] Priorto a BLAST analysis (Altschul et al. 1997, Nucleic Acids Res. 25:3389-3402) against Genbank (Release 
126), sequence was quality verified and redundant sequences or repetitive motifs are masked. 
Results are shown in Table 1 and are compared to human sequence. 

55 



SEQ ID NO: 


name 


length 


1 


EST of Cathepsin Y (rat) 


255 bp 
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(continued) 



SEQ ID NO: 


name 


length 


2 


rattus norvegicus Cathepsin Y DNA 


1387 bp 


3 


rat Cathepsin Y protein 


306 aa 


4 


human Cathepsin Y DNA 


1500 bp 


5 


human Cathepsin Y protein 


303 aa 



10 

[0100] Expression of the Cathepsin Y gene in the stroke model over a time period of 0.5 h to 2 years is shown in 
figure 1. 

Example 4: Enzymatic activity of Cathepsin Y protein 

15 

[0101] Cathepsin Y does not have the standard endopeptidic activity associated with proteases. In an effort to identify 
whether the enzyme has other proteolytic activity, a number of randomly selected synthetic oligopeptides were incu- 
bated with purified Cathepsin Y at pH 5.5 or 4.5. Specifically, purified Cathepsin Y was incubated with the selected 
synthetic oligopeptides (50 u.g/ml) at either pH 4.5 or pH 5.5, for 1 hou r at 37°C. Samples were quenched by the addition 
20 of trifluoroacetic acid to 1% final concentration, then analysed by reverse phase HPLC on a Vydac C1 8 column, using 
a gradient of increasing acetonitrile in 0.1% trifluoroacetic acid. Individual peaks (parent and new product(s)) were 
collected, then analysed by acid hydrolysis followed by amino-acid analysis. 
[0102] The results are summarised below: 



Parent Sequence 


Product(s) 


LFYDQSPTATI 


LFYDQSPTAT(aa 1-10 of the parent sequence) 
LFYDQSPTA (aa 1 -9 of the parent sequence) 
LFYDQSPT (aa 1-8 of the parent sequence) 


YKRDMVGGVVIA 


YKRDMVGGWI (aa 1-11 of the parent sequence) 
YKRDMVGG VV (aa 1 -1 0 of the parent sequence) 


EGYYGNYGVYA 


EGYYGNYGVY (aa 1-10 of the parent sequence) 
EGYYGNYGV (aa 1 -9 of the parent sequence) 


FFDEPNPGVTIY 


FFDEPNPGVT (aa 1-10 of the parent sequence) 



[0103] The above peptides, as well as other peptides recited herein, are listed, per convention, from amino terminus 

40 to the carboxyl terminus. 

[01 04] The results from a number of such experiments (all not shown) suggest that the proteolytic activity of Cathepsin 
Y is a sequential removal of the carboxy terminal amino acids, which is direct evidence for carboxypeptidase activity. 
No evidence of any endopeptidase or aminopeptidase activity was seen with any of these substrates. These data 
strongly suggest that the predominant proteolytic activity manifested by Cathepsin Y is carboxypeptidase activity. 

45 [0105] The quantitative analysis of carboxypeptidase activity of Cathepsin Y is based on the detection of the new 
free ami no-terminus generated on cleavage of a selected substrate. This is accomplished by reacting with the reagent 
ophthalaldehyde in an alkaline solution in the presence of 2-mercaptoethanol (Simons, et al., JACS, 98:7098-7099, 
(1 976)) . The peptide EGYYGNYGV was synthesised acetylated on its amino-terminus so that on cleavage by Cathepsin 
Y, the only free amino-terminus present in the reaction mixture will be that of the valine residue, cleaved off by the 

so carboxypeptidase activity of the protease. 

[0106] A standard curve was constructed by incubating varying concentrations of valine (0-20 u.M) in 0.25 M sodium 
borate, pH 10, containing 0.05% 2-mercaptoethanol and 60 ng/ml o-phthalaldehyde, in individual wells of a 96 well 
microtiter plate. The resulting fluorescence is read in a plate reading Cytofluor (ex 340, em 460 nm). There is a linear 
increase in the signal proportional to the amount of free valine present. 

55 [0107] In order to determine the pH optimum of the enzyme, reaction mixtures were set up with enzyme and substrate 
(0.2mg/ml) in a total reaction volume of 0.1 ml) in individual wells of 96-well microtiter plates, in 20 mM sodium acetate 
buffers at different pHs, with 0.1 % 2-mercaptoethanol present. Control wells were set identically except for the presence 
of enzyme. At various timepoints after incubation at room temperature, the reactions were quenched by the addition 
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of an equal volume of 0.45 M sodium borate, pH 1 0, with 0.25 mg/ml o-phthalaldehyde, and the fluorescence measured. 
In the absence of added enzyme, no measurable fluorescence is generated above background, even with extended 
incubations. In the presence of enzyme, there is a time-dependent increase in fluorescence. Based on this analysis, 
the pH optimum for carboxypeptidase activity was determined to be about 4.5, with the activity dropping off at both 
lower and higher pHs. 

[0108] Using this assay at pH 4.5, the ability of a number of compounds to inhibit the activity of Cathepsin Y was 
tested, by adding the desired concentration of the inhibitor in the incubation mixture along with enzyme and substrate, 
and measuring the decrease in fluorescence (if any) relative to enzyme control. Each of the compounds 1 to 1 3 tested 
in this analysis indicated inhibition of Cathepsin Y activity. 
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<110> Biofrontera Pharmaceuticals AG 

<120> Cathepsin Y for the development of a medicament for the 
treatment of stroke 

<130> 10662EPalleSeq 

<140> 
<141> 

<160> 5 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 255 
<212> DNA 

<213> rattus norvegicus 
<400> 1 

gggaggagcg accgacacga actagaaatt atgagcacaa gtacactgga atcctccagc 60 
ttcagagctg cttcctccac ccacagacct gcttcctcct ccaccgtgcc cgagcaggcc 120 
taacctccag accgtcagag aggacagcta tggtctagga cagttctggt gttaccctgg 180 
agtccacggg aggggaacta gtccagactg cctgagatga gtaaagtatc tggcgtcacc 240 
aaaaaaaaaa aaaaa 255 

<210> 2 
<211> 1387 
<212> DNA 

<213> rattus norvegicus 
<400> 2 

ctcactttat ctggggtgcg gatcgggtca agaggttgaa ggtgctgtgc gtgatccagg 60 
atccaagttg gcccccggag caggagcatg gcgtcgtcgg gatcggtgca gcagctgcgg 120 
ctggtgctgc tgatgttgct gctggccggt gcggcacggg ccagcctcta cttccgcccg 180 
ggccagacct gctaccgacc ccttcacagg gaccacctgg ctctgctggg gcgcaggacc 240 
tatcctcggc cacatgagta cctgtcccca gcggatctcc ccaagaactg ggactggagg 300 
aatgtgaacg gtgtcaacta tgccagcgtc acgaggaatc agcatatccc acagtactgt 360 
ggttcttgct gggcccacgg cagcaccagt gccttggcag atcgcatcaa catcaagaga 420 
aaaggtgcat ggccctccac cctgttgtcg gtgcagaatg tcatcgactg cggcaatgct 480 
ggttcttgtg aggggggcaa tgaccttccg gtgtgggagt atgcccacaa gcatggcatc 540 
cccgatgaga cctgcaacaa ctaccaggcc aaggaccaag aatgtgacaa gtttaaccag 600 
tgtgggacct gcactgaatt caaagagtgc cacaccatcc agaattatac cctctggaga 660 
gtgggtgact atggctccct gtccgggagg gagaagatga tggcagagat ctacgccaat 720 
ggtcccatca gctgcgggat aatggcaacc gagaggatgt ctaactacac tggaggcatc 780 
tatactgagt accagaacca ggccattatc aaccacatca tctctgtagc cggctggggt 840 
gtcagcaatg atggcatcga gtattggatt gtccgcaatt catggggcga gccctggggt 900 
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gagcgaggct ggatgagaat tgtgaccagc 
aacctcgcca ttgaggaggc ctgcacattt 
ggaagcaaca ctgtgaacca tgatgaggag 
cagctagaaa cagtggccct catgtggaca 
aggtgatagg aaaggatgag ccacaactac 
cctccaccca cagacctgct tcctcctcca 
agagaggaca gctatggtct aggacagttc 
actagtccag actgcctgag atgagtaaag 
aaaaaaa 

<210> 3 
<211> 306 
<212> PRT 
<213> r 



acctataagg gagggacagg ttccagctac 960 
ggggacccca ttgtctaggt agatgtccct 1020 
gggtgattat cgacactgga tatgtccatt 1080 
cgaggaccag agtgtgggct gcaccccgag 1140 
actggaaccc tccaccttca gacctgcttc 1200 
ccgtgccctg caggcctacc tccagccgcc 1260 
tggtgttacc ctggagtcca cgggagggga 1320 
tatctggcgt caccaaaaaa aaaaaaaaaa 1380 

1387 



Claims 

1 . Method for the development of a medicament for the treatment of stroke comprising the use of a polynucleotide 
sequence encoding Cathepsin Y or homologues or fragments thereof or the according protein / polypeptide. 

2. Method according to claim 1, comprising the determination of the efficiency of a compound or agent in a model 
for stroke. 

3. Method according to claim 2, whereby the efficiency of the compound /agent is determined by considering expres- 
sion rate of the gene or by considering activity of the protein. 

4. Method according to any of claims 1 to 3, comprising: 

a) providing a test cell population comprising cells capable of expressing the Cathepsin Y gene or homologues 
or fragments thereof 

b) contacting said test cell population with the test therapeutic agent; 

c) detecting the expression of the Cathepsin Y gene(s) in said test cell population, 

d) comparing the expression of the gene(s) in the test cell population to the expression of the gene(s) in a 
reference cell population whose disease stage is known; and identifying a difference in expression levels of 
the considered sequences, if present, in the test cell population and the reference cell population, thereby 
identifying a therapeutic agent for treating stroke. 

5. Method according to any of claims 1 to 3, comprising contacting the Cathepsin Y protein with a compound /agent 
in presence of a polypeptide which is a target of the Cathepsin Y, determining the amino-terminal amino acid of 
the peptide or the amount of free amino acids in the sample after incubation, comparing the amino-terminal amino 
acid of the peptide or the amount of free amino acids with the result in a sample which doesn't contain the compound 
/ agent. 

6. Method according to claim 3 or 4, whereby the gene expression is considered or determined by PCR of cDNA, 
hybridisation of a sample DNA, or by detecting the according protein. 

7. Use of a compound down regulating Cathepsin Y expression or activity for the manufacture of a medicament for 
the treatment of stroke. 

8. Use according to claim 7, whereby the compound is an inhibitor of Cathepsin Y protein activity. 
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9. Use according to claim 8, whereby the compound has the general formula 



10 



R^XJm-Y-NR^H 



R 2 

Cu- 



0 R 3 
ii I 

CNR'CH— 



-R 4 formula I 



- 1 n 



wherein: 

15 R is selected from the group consisting of hydrogen, alkyl of from 1 to 6 carbon atoms, and where R and R2 

are joined to form a ring structure of from 4 to 1 0 carbon atoms, 

R' is selected from the group consisting hydrogen, alkyl of from 1 to 6 carbon atoms and where R' and R 3 are 
joined to form a ring structure of from 4 to 1 0 carbon atoms, 
R 1 is selected from the group consisting of 

20 alkyl of from 1 to 4 carbon atoms substituted with from 1 to 5 substituents selected from the group consisting 

of (a) aryl of from 6 to 1 0 carbon atoms, (b) aryl of from 6 to 1 0 carbon atoms substituted with 1 to 3 substituents 
selected from the group consisting of alkyl of from 1 to 6 carbon atoms, aryl of from 6 to 10 carbon atoms, 
alkoxy of from 1 to 6 carbon atoms, aryloxy of from 6 to 10 carbon atoms, hydroxy, cyano, halo and amino, 
(c) cycloalkyl of from 3 to 8 carbon atoms and (d) heterocycles of from 3 to 14 carbon atoms having from 1 to 

25 3 heteroatoms selected from the group consisting of nitrogen, oxygen and sulfur wherein said substituted alkyl 

group is optionally further substituted with from 1 to 2 hydroxyl groups, alkenyl of from 2 to 4 carbon atoms 
substituted with from 1 to 4 substituents selected from the group consisting of (a) aryl of from 6 to 10 carbon 
atoms, (b) aryl of from 6 to 10 carbon atoms substituted with 1 to 3 substituents selected from the group 
consisting of alkyl of from 1 to 6 carbon atoms, aryl of from 6 to 1 0 carbon atoms, alkoxy of from 1 to 6 carbon 

so atoms, aryloxy of from 6 to 10 carbon atoms, hydroxy, cyano, halo and amino, (c) cycloalkyl of from 3 to 8 

carbon atoms and (d) heterocycles of from 3 to 14 carbon atoms having from 1 to 3 heteroatoms selected 
from the group consisting of nitrogen, oxygen and sulfur, aryl of from 6 to 10 carbon atoms, aryl of from 6 to 
10 carbon atoms substituted with 1 to 3 substituents selected from the group consisting of alkyl of from 1 to 
6 carbon atoms, aryl of from 6 to 10 carbon atoms, alkoxy of from 1 to 6 carbon atoms, aryloxy of from 6 to 

35 1 0 carbon atoms, hydroxy, cyano, halo and amino, flucrenyl, heterocycles of from 3 to 1 4 carbon atoms having 

from 1 to 3 heteroatoms selected from the group consisting of nitrogen, oxygen and sulfur; 
R 2 and R 3 are independently D- or L-amino acid side chains of at least 2 carbon atoms with the proviso that 
said amino acid side chains do not include the proline side chain; 

R 4 is selected from the group consisting of -C(0)CH =N=N, -CH 2 OH, -C=NOH, and -C(0)R 5 
40 where R 5 is hydrogen, alkyl of from 1 to 6 carbon atoms, haloalkyl of from 1 to 6 carbon atoms and 1 to 2 halo 

groups, alkoxy of from 1 to 6 carbon atoms, -NR 6 R 7 where R 6 and R 7 are independently selected from the 
group consisting of hydrogen and alkyl of from 1 to 6 carbon atoms, and aryl of from 6 to 10 carbon atoms, 
and -N(CH 3 )OCH 3 ; 

X is selected from the group consisting of -0-, -NR 9 -, and -S- where R 9 is selected from the group consisting 
45 of hydrogen, alkyl of from 1 to 6 carbon atoms and aryl of from 6 to 10 carbon atoms; 

Y is selected from the group consisting of -C(O)- and -C(S)-: 
m is equal to zero or one; and 
n is equal to zero, one or two, 

so or pharmaceutical ly acceptable salts thereof 

with the proviso that when R 1 is 1-naphthyl, R 2 is -CH(CH 3 ) 2 (L-isomer), R 3 is -CH 2 -0 (L-isomer), Y is -C(O)-, m 
is zero and n is one, then R 4 is not -N(CH 3 )OCH 3 , with the further proviso that when R' is diphenylmethyi, 
R 2 is p-(benzyloxy)benzyl (L-isomer), Y is -C(O)-, and m and n are zero, then 

R 4 is not -N(CH 3 )OCH 3 , and with still the further proviso that when R 1 is (1 : 2diphenyl)ethenyl, Y is -C(O)-, R 2 is 
55 -CH 2 -0 (L-isomer), and m and n are zero, then R 4 is not -N(CH 3 )OCH 3 . 

10. Use according to claim 9, whereby the compound is selected from the group of 



19 



EP 1 347 059 A1 



5 



10 



15 



20 



25 



30 




Compound 1 



Compound 2 



35 



40 



45 



50 



20 



EP 1 347 059 A1 




Compound 4 



Compound 5 



45 



50 



21 




22 



EP 1 347 059 A1 




Compound 9 



Compound 10 



Compound 1 1 



50 



55 



EP 1 347 059 A1 




Compound 12 



15 




11. Method for diagnosing stroke outside of a living body or for assessing the efficacy of stroke treatment outside of 
a living body characterised by the use of a polynucleotide sequence encoding Cathepsin Y or homologues or 

30 fragments thereof or the according polypeptide. 

12. Diagnostic kit for the method according to claim 11 , whereby the kit contains one or more reagents for detecting 
one or more of the sequences selected from the group consisting of SEQ ID NO. 1 to 5 or homologues or fragments 
thereof or antibodies directed against the protein sequences. 

35 

13. Use of an isolated nucleic acid sequence comprising an "antisense" sequence compared to a nucleic acid sequence 
encoding Cathepsin Y or a fragment of such a nucleic acid sequence as a medicament. 

1 4. Method according to any of claims 1 to 6 or use according to claim 7 to 1 3 : whereby the stroke is ischemic stroke. 

40 

15. Use of a transgenic animal, wherein the gene encoding Cathepsin Y is manipulated in the animal in comparison 
to the wild type for the investigation of stroke or of compounds influencing stroke. 
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